RONCHI AND PERONA: DESCRIBING COMMON HUMAN VISUAL ACTIONS IN IMAGES 1 Describing Common Human Visual Actions in Images

نویسندگان

  • Matteo Ruggero Ronchi
  • Pietro Perona
چکیده

Which common human actions and interactions are recognizable in monocular still images? Which involve objects and/or other people? How many is a person performing at a time? We address these questions by exploring the actions and interactions that are detectable in the images of the MS COCO dataset. We make two main contributions. First, a list of 140 common ‘visual actions’, obtained by analyzing the largest on-line verb lexicon currently available for English (VerbNet) and human sentences used to describe images in MS COCO. Second, a complete set of annotations for those ‘visual actions’, composed of subject-object and associated verb, which we call COCO-a (a for ‘actions’). COCO-a is larger than existing action datasets in terms of number instances of actions, and is unique because it is data-driven, rather than experimenter-biased. Other unique features are that it is exhaustive, and that all subjects and objects are localized. A statistical analysis of the accuracy of our annotations and of each action, interaction and subject-object combination is provided.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Describing Common Human Visual Actions in Images

Which common human actions and interactions are recognizable in monocular still images? Which involve objects and/or other people? How many is a person performing at a time? We address these questions by exploring the actions and interactions that are detectable in the images of the MS COCO dataset. We make two main contributions. First, a list of 140 common ‘visual actions’, obtained by analyz...

متن کامل

Distance estimation of an unknown person from a single portrait

This document accompanies the paper “Distance estimation of an unknown person from a single portrait”. We provide further insight on some of the parameter/method choices made in the main paper and additional physiognomy interpretation of the results.

متن کامل

Quantum of Vision

All rights reserved iii ACKNOWLEDGEMENTS I enjoy doing research that is different. Being different means not inheriting the definitions and solutions of existing problems but striving to recognize and address new problems. This notion of being different is imparted to me by my advisor Prof. Pietro Perona, to whom I am deeply grateful. I have consulted Pietro for basically everything: the meanin...

متن کامل

Distance Estimation of an Unknown Person from a Portrait

We propose the first automated method for estimating distance from frontal pictures of unknown faces. Camera calibration is not necessary, nor is the reconstruction of a 3D representation of the shape of the head. Our method is based on estimating automatically the position of face and head landmarks in the image, and then using a regressor to estimate distance from such measurements. We collec...

متن کامل

Effectiveness of the Baby Friendly Community Initiative in Italy: a non-randomised controlled study

OBJECTIVE To assess the effectiveness of the Baby Friendly Community Initiative (BFCI) on exclusive breast feeding at 6 months. DESIGN Controlled, non-randomised trial. SETTING 18 Local Health Authorities in 9 regions of Italy. PARTICIPANTS 5094 mother/infant dyads in 3 cohorts were followed up to 12 months after birth in 3 rounds of data collection: at baseline, after implementation of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015